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TIME-SLOT ASSIGNMENT PROCESS 




This invention relates to a time-slot assignment process for use in a contention 
5 resolution scheme for switches and switch arrangements in a communications networlc. 
In particular, but not exclusively, the invention relates to an aggregated channel 
assignment technique for multi-stage buffering and switching in cell/packet switches and 
switching arrangements in a communications network. 

1 0 The aggregated channel assignment solution is suitable for use with scalable matching 
algorithm which matches asymmetric request matrices for switches and switch 
arrangements as described in our co-pending patent application filed herewith whose 
contents are hereby incorporated by reference, entitled "MULTI-STAGE- MATCHING 
PROCESS". 

15 

This invention complements the multi-stage matching invention described in the^ 
inventors co-pending patent application entitled "MULTISTAGE 
MATCHINGPROCESS", filed herewith as both inventions provide means for reducing > 
computing steps in frame-based scheduling. Nonetheless, it is not a requirement that:'' 
20 this invention be used in conjunction with multi-stage matching, merely an option. 

Single stage switch/network fabrics are well known in the art. Time-slot assignment 
of cell or packet requests between the input and output ports of switches and 
networks, such as is shown schematically in Figure 1 , is conventionally performed in 

25 a single stage. The number of input and output ports is taken here to be LN. If the 
switch fabric is a single-stage switch matrix, whether Implemented as a space switch 
or as a wavelength switch, and cells or packets are switched between the input and 
output ports in a frame with a duration of F fixed-length time slots, then the time 
slots can be chosen using a path-search algorithm in a conventional three-stage TST 

30 switching network. Each input and output port of the 3~stage logical switch 
represents a time slot occupied by an individual cell or packet. There, are LNE such 
input and output ports. 



When each "connection" path between the LNF input and output ports across the 3- 
stage switch is implemented sequentially, the computing time (steps) is OILNF^). If 
we assume, for example, that the computing time allowed for time-slot assignment is 
as much as a full frame of F time slots, i.e. R seconds, this requires a single 
5 processor with a clock speed 0{LNFVFt) = 0(LNF/t) Hz. For a 10 Tbit/s ceil/packet 
switch with LN= 1024 ports at B = 10 Gblt/s line rate, F = LN time slots per frame and 
T = 50 nsec, this single processor would need a clock speed of 20 THz, which is 
twice the capacity of the switch it is controlling. A single sequential processor at 20 
THz is impossible. More precisely, the ratio of total processing capacity to switch 
1 0 capacity is 

ProcessaigCapacity _ 0{LNFIr) _Jf\J F . , . .eqtn. 1 

SwitchCapaOty ~ LNB l^rsj \S0Q) 
which is a ratio of 1 024/500 = 2 for this example. 

The inventors earlier patent specification WO 01/67802 entitled "Packet Switching" 
15 describes a doubly parallel implementation of the path-search algorithm, in order to 
reduce the computing steps to 0(LNIog2F). This employed LN parallel processors, 
each one of which contained a compact superconcentrator for finding and seizing in 
parallel the required, number of free, common time slots between the occupancy 
vectors of a pair of input and output ports. Each of the LN parallel processors must 
20 have a clock speed of at least QdNlogaF/Fx) Hz. In the same example, this is 200 
MHz, which In itself is quite acceptable. But each compact superconcentrator 
possesses an omega network with 0{F) ports. So the total "throughput" of all the 
compact superconcentrators is 0{LNFx2o6) MHz = 0(200 THz). This is 20 times the 
capacity of the switch being controlled, which is absurd. More precisely: 



YrocessorThroughput I ) _Jl^}^^ii^^ AHD2Ml\ 

'^^ SwitchCapacity LNB { zB ) \^ 500 J 

which Is a ratio of 20 for this example. . 



.eqtn. 2 



A large reduction in processing capacity can be achieved by using only a single level 
of parallelism from WO 01/67802. LN port processors would still operate in parallel, 
30 each one finding the required number of free, common time slots between a pair of 
input and output ports, but the time slots themselves would be found by sequential 



3 . 



inspection. All input-output port pairs are processed after LN steps of parallel 
processing. There are therefore LN processors, each taking up to O(LNF) computing 
steps altogether. Hence 



5 This ratio is around 2 again, as in eqtnl . The processing capacity is still greater than 
the switch capacity being controlled. Furthermore, each sequential processor must 
have a clock speed 0(LN/t) = 20 GHz, which is out of the question today. 



10 across a single-stage switch fabric, using existing time-slot assignment algorithms 
and circuits, are too large. Without improved algorithms and circuits, multi-stage 
switch fabrics must be employed. 

Although multi-stage switches and networks must now be considered as a means of ' 
15 reducing processing complexity, it is insufficient simply to change the single-stages 
switch matrix of Figure 1 into a 3-stage space switch fabric. Together with the out# 
time switches, the structure becomes a logical, 5-stage TSSST switch. Of course ift 
agile wavelength channels are being used to implement the space switching, this can?' 
be a logical, 5-stage TXSXT switch, for example. However, all the processing 
20 requirements described above would still be needed to assign the time slots across 
the 3-stage space switch. In addition, further processing would be needed to 
establish the paths through the 3-stage space switch fabric in every time slot. 

Simply applying the same number of time slots to a multi-stage switch fabric in this 
25 way is of no benefit in reducing the processing complexities needed for channel 
assignment. 





For such large switches, the processing capacities required to assign time slots 



30 



The invention seeks to mitigate and/or obviate the disadvantages of known time-slot 
assignment techniques by providing a multi-stage time-slot assignment process for a 
scheduling process in which: 




4 . 

i) time slots are aggregated from multiple ports to provide a common pool of 
time-space channels; and 

ii) the T*, 3"*, 5* and optionally 7* stages provide time-slot interchange 
capability, as decried herein below. 

5 

A first aspect of the invention seeks to provide a multi-stage time-slot assignment 
process for an input-queued switch arrangement in a communications network, the 
switch comprising a plurality of N ingress elements and N egress elements, each of 
the ingress elements having a number L of ingress subelements and each of the 

10 egress elements having a plurality L of egress subelements, the switch arrangement 
being arranged to have L or more real middle stage space switches of size N x N, and 
having F or more time-slots, the time-slot assignment process between the said 
ingress subelements and egress sub-elements comprising the steps of: aggregating F 
time slots from each of a plurality L in number of said ingress subelements to form an 

15 ingress element having a plurality LF or more in number of time-space channels which 
are pooled between the L subelements of each ingress element and the L 
subelements of each egress element, wherein each time-space channel corresponds 
to a different logical middle-stage switch of the packet switch arrangement so that 
the number of logical input elements and logical output elements for which channel 

20 assignment is performed through the middle stage of the switch is N; performing 
time-space channel assignment through the middle stage of the switch between the 
logical input elements and the logical output elements; providing time-slot 
interchange capabilities at the T^, S"*, 5* stages; and performing time-slot 
assignment between the ingress subelements of the ingress elements and the logical 

25 output ports of the 3"* stage switches, and performing time-slot interchange between 
the logical input ports of the 5* stage switches and the egress subelements of the 
egress elements. 

The total number of ingress sub-elements in each element may not equal the number 
30 of egress subelements in each element, providing the total number of inqress 
subelements for the switch equals the total number of egress subelements for the 
switch. 



5 

The switch arrangement may be arranged to switch cells or packets, 
the switch arrangement may be a circuit switch. 



Alternatively, 



The switch arrangement may a cell switch arrangement capable of switching 
5 packets. 

The time-slot assignment process according to the first aspect may further comprise 
a 7^^ stage providing a time-slot interchange capability. 

10 The sub-elements may comprise ports, and the elements comprise aggregations of 
ports. 

The switch arrangement may be an input-queued switch comprising VOQs which are 
implemented in random access memory RAM (for example, - when the switch 
1 5 arrangement is arranged to switch cells or packets). >. 

The time-slot assignment process according to tlie first aspect may furtlier comprise 
the recursive decomposition of the three stages In each element into seven stages 
using the steps according to the first aspect. 4^ 

20 

A second aspect of the invention seeks to provide a switch arrangement in which 
time-slots are assigned by a time-slot assignment process according to the first 
aspect. 

25 A third aspect of the invention seelcs to provide a network having a switch 
arrangement according to the second aspect. 

A fourth aspect of the invention seeks to provide a suite of at least one computer 
program arranged to implement the time-slot assignment process according to the firs 
30 aspect. 

The suite of at least one computer program may be at least partly arranged to be 
Implemented in hardware. The suite of at least one computer program may be at 
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least partly arranged to be implemented in software. Thus it is possible in some 
embodiments of the invention to implement the time-slot assignment process using a 
combination of hardware components and software. 

5 Those skilled in the art will appreciate that the preferred features of the invention as 
defined by the dependent claims may be combined with each other where appropriate 
any suitable manner apparent to those skilled in the art and with any of the aspects 
where appropriate and in any suitable manner apparent to those skilled in the art. 

10 The embodiments of the invention will now be described by way of example with 
reference to the accompanying drawings in which: 

Figure 1 shows schematically input and output ports of a cell/packet switch or 
cell/packet switching network; 

Figure 2 is a schematic diagram provides a 7-stage switching architecture for 
aggregating the time slots of L ports into a common pool of LF channels; 

Figure 3 shows schematically the Assignment of cells or packets to logical inputs and 
20 outputs of the 7-stage logical switch architecture in an embodiment of the invention; 

Figure 4 shows schematically paths and cell/packet assignments after the first path- 
search phase through the middle-stage switches in the embodiment of the invention 
shown in Figure 3; 

25 

Figure 5 shows schematically paths and cell/packet assignments after the second 
path-search phase through the individual sub-networks in the embodiment of the 
invention shown in Figure 3; 

30 Figure 6 shows schematically paths assigned to each of the 1 6 cells or packets 
through the 7-stage logical switching network in the embodiment of the invention 
shown in Figure 3; and 
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Figure? shows schematically re-ordering of cells or packets to prevent mis- 
sequencing when the y^^'-stage time-slot interchangers are left out in the embodiment 
of the invention shown in Figure 3. 

5 

The best mode of the invention as currently contemplated by the inventors will now 
be described with reference to the accompanying drawings. Those skilled in the art 
will recognise that the detailed description provided of specific embodiments of the 
invention below is not intended to represent the only means of implementing the 
10 invention,, and that any suitable means which is apparent to a person skilled in the art 
may be used where appropriate to implement any specific feature of the invention as 
described. 

This invention relates to the channel assignment part of a frame-based scheduling 
15 algorithm. However, it is also widely applicable to the general problems of time-slot 
and channel assignment in switches. The technique provides a means for reducing 

the computing steps required to implement the parallel Clos path-searching algorithm, 

. if 

as applied to the time-slot assignment of cells and packets in frame-based schedulinjg 
(and as described by the inventors in International PCT patent application \NO 

20 01/67802 entitled "PACKET SWITCHING", the contents of which are hereby 
incorporated by reference). This is achieved by defining a larger set of channels than 
there are time slots in the frame and selecting (path-searching) those channels 
through the middle stage of a multi-stage switching architecture. The outer stages of 
this architecture are themselves then recursively expanded into 3-stage switches and 

25 their time slots chosen (path-searched). The result of this is to require time-slot 
interchangers to. be inserted as additional switching stages within a multi-stage 
switching fabric. 

This principle enables channel assignment to be performed within and between 
30 smaller switches or sub-networks, hence reducing the computing complexity. The 
principle can be applied to sub-networks in the form of individual smaller crossbars, 
individual smaller cell/packet switches, individual rings and individual PONs. When 
this principle is applied to multi-stage fabrics of individual smaller crossbars, the 
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switching fabric (i.e. tlie space switches), excluding the additional time-slot 
interchangers, remains the same as would be required without this technique, i.e. 
there is no speed-up required of the space switch fabric to implement the technique, 
just 'additional buffers/time-slot interchangers. These could of course be Implemented 
5 in the form of additional cell/packet switch linecards. If both contiguity and no mis- 
sequencing of cells/packets from the same VOQ are required at the output of the 
cell/packet switch, then 3 additional stages of time-slot interchangers (buffers) are 
necessary. But if only no mis-sequencing is required, allowing cells/packets from the 
same VOQ to exit the cell/packet switch with cells/packets from other VOQs 
10 interspersed between them, then only 2 additional stages of time-slot interchangers 
(buffers) are required. These requirements compare favourably with the 2-stage 
Birkhoff-von Neumann switch, which requires two space switch fabrics and 1 
additional stage of buffers, and even more favourably with the parallel packet switch 
architecture, which can require a fabric speed-up of 3. 

15 

. Although this description applies decomposition only once to the sub-networks, 
turning them into 3-stage TST switches and the initial 3-stage logical switch into a 7- 
stage logical switch, the principle could of course be applied recursively again to the 
resulting 3-stageTST switches within each sub-network, turning them into 7-stage 
20 logical switches and the initial 3-stage logical switch into a 15-stage logical switch, 
and so on. 

The inventors have found that in order to reduce the processing complexity needed 
for channel assignment, the number of channels to be assigned is increased. This is 

25 achieved by grouping together L ports into sub-groups or sub-networks, so that their 
time slots are aggregated into a common pool of LF time-space channels. See Figure 
1 of the accompanying drawings. Each channel corresponds to a different middle- 
stage space switch (both real and logical). There are L real middle-stage space 
switches, each of size NxN and each of which exists in F time slots. This aggregation 

30 technique, has the effect of reducing the number of logical ports for which channel 
assignment is performed through the middle stage of the switch to just N. Since the 
number of computing steps (see eqtns.1-3) is generally related to the number of 
ports, this reduces the processing complexity. 



9 



Figure 2 is a 7-stage switching architecture. The sub-groups or sub-networl<s of ports 
are themselves recursively decomposed into 3-stage switches. This is necessary 
because two cells or packets sent or received by any of the LN real switch ports may 
5 be assigned to the same time slot through different middle-stage switches. Since real 
ports have only one transmitter and receiver, they cannot transmit or receive more 
than one cell or packet in the same time slot. Their cells or packets must be 
transmitted in different time slots within the sub-group or sub-network of ports to 
other, different real ports (or linecards), one of which could be the same as the 

10 transmitting one, where they must be time-slot interchanged as required. The 3-stage 
sub-networks therefore consist in an LxL space switch (or wavelength switch,, for 
example) which is configured in F different time slots. Thus the 1", 3"", 5"^ and 7* 
stages of the T-stage architecture are time-slot interchangers. The switches in thed** 
and 7* stages represent the real ports or linecards of the overall switch. The 1'*-stage 

15 time-slot interchangers in reality represent the forwarding of each cell or packet from 
the correct VOQ in the correct time slot. 

Of course there is nothing new about the recursive construction that decomposes .^he 
outer switches of a 3-stage switch into 3-stage switches themselves, resulting iij 7 
20 stages altogether. This was evident from the earliest work of Clos to the explicit 
statement in United States patent number US 5,801,641. 

The invention provides a time-slot assignment process for a scheduling process in 
which: 

25 

iii) time slots are aggregated from multiple ports to provide a common pool of 
time-space channels; and 

iv) the 1", 3'^ 5* and optionally 7* stages provide time-slot interchange 

capability. 

30 

The 7* stage does not need to be implemented in certain circumstances. 
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Time-Slot interchangers in tlie 1" stage represent the forwarding of each cell or 
packet from their positions in their VOQs into the switch fabric in the correct time 
slot. 

Before channel assignment can be implemented, it is first necessary to assign the 
LNF inputs and LNF outputs of the network to the cells or packets to be switched in 
the frame. This does not need to be done for every cell or packet individually. 
Instead, on the input side of the switch, every real port or linecard would assign a 
number of contiguous inputs (i.e. time slots) to the au successfully accepted requests 
of each of its LN VOQs, i.e. from' VOQ ai.i t6 ai,LN, obtained from a previous matching 
phase. Similarly, on the output side, every real port or linecard j would assign a 
number of contiguous outputs to the ay successfully accepted requests from each of 
the LN inputs, i.e. from VOQ aij to aiNj. 

If contiguity of outgoing cells or packets from the same VOQ is required, then all 4 
stages of time-slot interchanger are required. But if contiguity of outgoing cells or 
packets from the same VOQ is not required, the last stage of time-slot interchange 
can be left out; i.e. the real output ports would have no buffers and cells or packets 
would leave the switch in the time slots assigned to them within the sub-group or 
sub-network (stages 5 to 7). However, this could lead to mis-sequencing of 
cells/packets from the same VOQ. Mis-sequencing can be prevented very simply by 

a) implementing the VOQs in RAM buffers Instead of separate FIFO buffers 
and 

• re-ordering the time slots in which particular cells or packets within 
the same VOQ are transmitted from the real input ports (linecards), 
in the 1" stage; i.e. cells or packets would not exit a VOQ in the 
time order of their position in the VOQ (no longer FIFO) 

• not implementing the 1 "-stage time-slot interchangers 

OR 

b) implementing the VOQs in FIFO buffers and 

• reading the cells or packets from their VOQs in FIFO order 
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• re-ordering the time slots in which the cells or packets are 
forwarded into the switch using 1^-stage time-slot interchangers 



Path-searching (i.e. time-space channel assignment) is performed first through the 

5 middle-stage (4^^) switches, by assigning specific. middle-stage switches (channels) to 

each sub-group or sub-network. Secondly, each individual sub-group or sub-network 

is path-searched (time-slot assigned). Although path-searching algorithms for 

rearrangeably non-blocking switching networks could be employed, we will assume, 

for example, that just a single level of parallelism from the parallel path-search 

o o I / '^o 

10 algorithm for 3-stage (Clos) networks from IPD Caoo A25962 is employed to assign 
channels and time slots. Port processors would operate in parallel, each one finding 
the required number of free, common channels between a pair of input and output 
ports, but the channels themselves would be found by sequential inspection. ^ 

15 First Path-Search Through iVliddle-Stage Switches 

The first path-search through the middle-stage switches (assignment of LF time-space 

channels) would require N processors, each taking O(NLF) computing steps. Eaci^. of 

the processors must therefore have a clock speed 0(NLF/Fi:) = 0(NL/t) Hz, which is 

20 Gbit/s in our example. This is too high for today's sequential processors. 

20 Nevertheless, the ratio of processing capacity to switch capacity is 



which is around a ratio of 0.064. The reduction from LN real ports to N sub-groups or 
sub-networks provides a proportional reduction in processing capacity compared with 
eqtn.3. The aggregation technique is therefore extremely promising, but another 

25 degree of parallelism is required to reduce the computing steps, without increasing 
the number of processors too much. To do this, let us employ n separate processors 
to perform sequential searching for free, common channels, each one searching 
sequentially through only 0(LF/n) of the 0(LF) channels. Each one would count the 
number of free channels that it finds. Then the individual counts would be added 

30 sequentially (e.g. by one of the n processors) in 0(n) steps. When the sum reaches or 
exceeds the required number of channels, the adding can stop. The channels found 
by all the processors before the one that reached or exceeded the required number 
can be seized and assigned to both the relevant, paired sub-groups or sub-networks. 




r 
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Alternatively, if they haven't yet been assigned, these processors can search again, 
assign and seize. The processor that reached or exceeded the required number can 
then search again, stopping when it has found, assigned and seized the last required 
channel. For this level of parallelism In the first path search: 




...eqt»i . 5 



f. Vioces^gCapacity Ft In n J _JN{^^ n_ ^^J^jH-— 

° SwitchCapacity LNB \tB\^ LF )) ' [sOOl LF )) 

Because this ratio Increases as the square of the number n of processors performing 
the sequential search in each port "processor", the number n should be as small as 
possible. Let us assume that a maximum processor clock speed of 2 GHz Is available, 
hence 

10 ■ • of-^^.«.^)] = 2 GHZ ...eqtn. 6 



for which n is 21 in our example. This would lead to a processing/swrtching ratio 



ProcessmgOgwcay _ J 2.01N^ _ J JfA ...eqtn. 7 

SwitchCapacity K 500 j 1,250 j 

Thus, in our example, the penalty for reducing the processor speeds from 20 GHz to 
a practical value of 2 GHz is merely to double the processing capacity requirement. 
1 5 The resulting ratio for this first path search Is just 0.1 28. 

Second Path-Search Through Individual Sub-Networks 

Path-searching (time-slot assignment) is next performed within each individual sub- 
group or sub-network. There are 2N sub-groups or sub-networks, each of which is 

20 path-searched in parallel. Furthermore, each sub-network is path-searched using L 
port processors in parallel, using the first '^^'Q°^^f^''|'!^'^?zf^ parallel path- 
search algorithm for 3-stage Clos networks in IPD Cuac A2606& . Each one of these L 
port processors is assumed to search sequentially through 0(F) time slots. (Of course 
blocking may require somewhere between F and 2F-1 time slots to be used). Hence 

25 the total number of processors is 2NL, each taking 0(LF) computing steps. The 
processor clock speed required is 0{LF/Ft) = 0(L/x) = 640 Mbit/s. which Is 
practicable. The processing/switching capacity ratio is 

( 2NI?F ^ 
"Pi ocessim gCapacity _ \ -^^ 



^2L^_L_ ...eqtn. 8 

SwitchCapacity LNB iB 250 

which is the same value 0.1 28 in our example. 



30 
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Thus from eqtns.7 and 8 the total ratio of processing capacity to switch capacity is 
0.256. For a 10 Tbit/s switch this is an acceptable 2.56 Tbit/s. 



Alternatively, the individual sub-groups or sub-networks could be path-searched using 
5 Andresen's algorithm (for example, see Steinar Andresen, "The looping algorithm 
extended to base 2* rearrangeable switching networks," IEEE Trans. On Comms., Vol. 
COM-25, No. 10, 1057-1063 (1977), the text of which is hereby incorporated by 
reference), which uses a mapping between rearrangeably non-blocking Benes 
networks, which use multiple stages of 2x2 switches, and 3-stage Clos networks. 

10 This could result in an even lower processing capacity requirement. Andresen's 
algorithm can be implemented sequentially or using parallel processors. In practice, 
there is likely to be an optimum number of parallel processors that should be 
employed in order to reduce the processor speed to an acceptable value while 
achieving an acceptable total processing capacity. As this is a rearrangeably nori- 

1 5 blocking algorithm, it guarantees that no more than F time slots are required within 
the sub-groups or sub-networks. 

A Worked Example 

To demonstrate the operation of the aggregated channel assignment technique^in 
20 multi-stage switch fabrics, an example 4x4 cell or packet switch with L = 2 ports per 
sub-group or sub-network, N = 2 sub-networks, LN=4 ports (input and output) and 



frame duration F = 4 time slots will be used. The traffic matrix for this example is 
taken from the results of the multi-stage matching algorithm in JPD Coco A^OV^-I.. 
using the modified NOB25 pointer up-date rule which is deterministic for input ports 
25 but non-deterministic for output ports. The matrix of accepted requests for the next 
frame is 

"0 0 13' 
r /• -^1 0 2 2 0 

[a2(x,D)] =12 10 

3 0 0 1. 

Each matrix element a2{i,j) represents the number of cells or packets to be taken from 
each of the VOQs {i,j) at the 4 real input ports (linecards) and switched across the 
30 fabric in the next frame to the 4 real output ports (linecards). The logical 7-stage 
switch architecture shown in Figure 3 has 4 logical input and output ports for each 



.eqtai. 9 
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real input and output port, i.e. 1 6 logical input and output ports altogether. Each 
of these represents an individual cell or packet to be switched across the fabric. 



Assigning Cells or Packets to Input and Output Ports 

5 

Figure 3 shows schematically the result of assigning the 16 logical inputs and 16 
logical outputs of the network to the cells or packets. This is not done for every cell 
or packet individually. Instead, on the input side of the switch, every real port or 
linecard i assigns a number aij of contiguous logical inputs (i.e. time slots) to the aij 

10 successfully accepted requests of its 4 VOQs. Similarly, on the: output- side, every 
real port or linecard j assigns a number aij of contiguous logical outputs to the atj 
successfully accepted requests from each of the 4 VOQs destined for it. The cells or 
packets are identified by three numbers. The first two give the identity of the cell's or 
packet's VOQ (i,j), the third number in brackets is the specific identity of the cell or 

15 packet in its VOQ. For convenience the position of the cell or packet within its VOQ 
is used as its specific identity. For example, the HOL packet in VOQ 1,3 is designated 
1,3(1). The contiguous cells or packets in the same VOQ could be assigned to the 
logical input and output ports in any order. However, for convenience, they are 
assigned in the increasing order of their positions in their VOQ. 

20 

First Path-Search Through Middle-Stage Switches 

The first path search requires N = 2 processors. The parallel path-search algorithm for 

(/OO 0\ IC'7<SrQ'2^ . , . ^ .. . 

3-stage switches from IPD Cuby. A2D0G2 requires the two left-hand-side sub-. 

25 networks and the two right-hand-side sub-networks to be taken in pairs. Each pair is 
processed by one of the 2 processors. This can be performed simultaneously, in 
parallel. For example, pairs (Sihsl, Srb.1) and (S.hs2, Srhs2) could be processed first. 
When these have been path-searched, the pairings are changed and path-searching 
undertaken for the next set of sub-network pairs. For example, one of the sub- 

30 networks in the pairings could be Incremented by 1 (cyclic pairing). Of course in the 
present switch example, there is only one other set of pairings possible; (Sihsl , Srhs2) 
and (Sih82, Srt«.1). Each processor adds up the total number of accepted requests in 
the VOQs between the pair of sub-networks it is dealing with. From eqtn.9 the 
reduced matrix of accepted request numbers between the sub-networks becomes 
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[a(Sihai,Sthsj) ] = 




. .eqtn. 10 



Each processor then finds the first a(Sihsi,Srhsj) free, common middle-stage switches 
.5 (channels) that have not already been seized, between the first pair of sub-networl<s. 
The search begins from the uppemnost, first middle-stage switch. This method is 
traditional "call packing". Each processor then proceeds to repeat the path-search for 
its second pairing of sub-networks. The assignment of a middle-stage switch to a 
particular pair of sub-networks also decides the path through th^ middle-stage 
10 switch. After all the accepted requests have been assigned a middle-stage switch 
(channel), the processors can assign specific cells or packets to those middle-stage 
switches. For convenience, cells or packets within the same VOQ are assigned tp the 
middle-stage switches in the increasing order of their positions within the VOQ. The 
result of all this is to guarantee that the sequence of cells or packets within the same 
1 5 VOQ is preserved in the sequence of middle-stage swjtches (channels) assigned to 
them. The paths and cell/packet assignments after the first path-search pjiase 
through the middle-stage switches (channels) are shown in Figure 4. LF= 2x^=8 
middle-stage switches (channels) are sufficient; none of the 16 "connections^ is 
blocked. 

20 

Second Path-Search Through individua l Sub-Networks 

Although the individual sub-networks could either be path-searched using parallel 
path-searching for 3-stage Clos networks or using a rearrangeably non-blocking 
algorithm such as the looping algorithm or Andresen's algorithm, the paths shown in 
25 Figure 5 have been established using the parallel path-search algorithm. This is 
performed in a similar fashion to the first path search. 

To establish the paths, a reduced matrix of numbers of "connections" [c(S)] is first 
calculated between the and 3'"- stage switches of each sub-network from the 
30 cell/packet assignments on either side of the sub-network, as follows: 

[c(Si^l)] 2] • . • 

[c(Sa^2)] = g 12 




[c(Srhsl)] 



[0(Srhs2)] 




. .eqtn. 13 



. .eqtn. 14 



These serve a similar role for each sub-network to that of eqtn. 10 in the first path 
search. The identities of the specific cells or packets are then assigned to the middle- 
5 stage switches of each sub-network (i.e. the 2"^ and 6* stages of the 7-stage logical 
network). 

The cell/packet assignments are now sufficient throughout the network to be able to 
• complete the paths" through the 1"-, 3""-, 5*- and 7^-stage switch matrices-.- Rgure 6 
10 of the accompanying drawings shows the complete paths assigned to every cell or 
packet. All cells or packets from the same VOQ are in sequence and contiguous at 
the logical outputs of the 7"^ stage, which means that they will be in sequence and 
contiguous when they are transmitted to line. 

1 5 However, if the 7**' logical stage of time-slot interchangers is left out, so that cells or 
packets can go straight out to line without having to be buffered at the real output 
ports (linecards), then cell/packet 3,2(1) and 3,2(2) will become out of sequence and 
lose contiguity, as will cells/packets 1,4(1), 1,4(2) and 1,4(3). Although loss of 
contiguity may be acceptable, mis-sequencing may not be. Fortunately, mis- 

20 sequencing can easily be prevented, without requiring any additional re-sequencing 
buffers. The method of prevention is purely algorithmic. At each 7*-stage logical 
switch, the order in which cells/packets within the same VOQ appear at the input 
ports of the switch can be used to re-order the time sequence in which the 
cells/packets must be forwarded from their VOQs in the 1'* stage. For example, the 

25 cells/packets in VOQ 1,4 appear in the order 1,4(2), 1,4(3), 1,4(1) on the input ports 
of 7*-stage logical switch 4. The paths taken through the network by these 
cells/packets, which start at the logical output ports of the 1»*-stage switch 1, must 
be mapped to cells/packets at the logical input ports of the 1»*-stage switch 1 whose 
identities correspond to this order; i.e. 1,4(2)-»1,4(1), 1,4(3)^1,4(2) and 

30 1,4(1)-)- 1,4(3). The connections across 1"-stage switch 1 are rearranged to provide 
this mapping, which of course re-orders the time sequence in which the cells/packets 
will be transmitted from the 1 "-stage logical switch 1. Figure 7 shows the re-ordered 
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cell/packet path assignments for preventing mis-sequencing when the 7*-stage time- 
slot interchangers are left out; i.e. when there is no buffering in the real output ports 
(linecards) before transmission to line. Cells/packets 1,4(1), 1,4(2). 1,4(3), 3,2(1) and 
3,2(2) have been re-ordered at the logical outputs of the 1^-stage time-slot interchangers 
5 1 and 3. 

Hence the HOL cell/packet 1,4(1) will be transmitted after cell/packet 1,4(3), which 
is behind it in VOQ 1,4, and before cell/packet 1,4(2). HOL cell/packet 3,2(1) is also 
transmitted after cell/packet 3,2(2) in 1^*-stage logical switch 3. If the VOQs are 

10 implemented as FIFO buffers, then the 1^^-stage switches must be implemented as ■ 
time-slot interchangers so that the sequence of cells/packets can be re-ordered. But If 
the VOQs can be implemented as RAM buffers instead of FIFOs, then the re-ordering 
can be performed in the process of reading out the cells/packets from the RAM 
buffer, so that no additional time-slot interchangers are required in the 1^^ logical 

15 stage.. Evidently the y^'^-stage switches must be included logically in the path- 
searching algorithm, in order to determine the correct sequence in which cells or 
packets must be forwarded from their VOQs to prevent mis-sequencing, but they;;are 
not physically implemented. ^ 

■u 

20 Those skilled in the art will appreciate that the number of stages can vary and rhay 
depend on the number of levels of aggregation used. For larger switch arrangements it is, 
possible to have a larger number of buffer stages. The invention can be implemented in 
any suitable form, including software and/or hardware and the software algorithm may be 
provided in a form which is distributed amongst several components. The invention may 

25 be also implemented as a suite of one or more computer programs. 

The invention may be implemented such that the egress subelements are not grouped 
into egress elements in one embodiment of the example, providing the total number of 
egress subelements equal the total number of ingress subelements. Those skilled in the 
30 art will appreciate that the invention can be implemented in a switching arrangement 
where the ingress elements are bidirectional and comprise said egress elements (and 
also where both ingress and egress elements are provided and are each bidirectional). 
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The text of the absfract is hereby deemed incorporated as part of the specification of 
invention. 
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CLAIMS 



1. A multi-stage time-slot assignment process for an input-queued switch 
arrangement in a communications network, the switch comprising a plurality 

5 of N ingress elements and N egress elements, each of the ingress elements 

having a number L of ingress subelements and each of the egress elements 
having a plurality L of egress subelements, the switch arrangement being 
arranged to have L or more real middle stage space switches of size N x N, 
and having F or more time-slots, the time-slot assignment process between 
10 the said ingress subelements and egress sub-elements comprising the steps 

of: 

" aggregating F time slots from each of a plurality L in number of said 
ingress subelements to form an ingress element having a plurality LF or more 
in number of time-space channels which are pooled between the L 

15 subelements of each ingress element and the L subelements of each egress 

element, wherein each time-space channel corresponds to a different logical 
middle-stage switch of the packet switch arrangement so that the nurhber of 
logical input elements and logical output elements for which channel 
assignment is performed through the middle stage of the switch is N; v.;: 

20 performing time-space channel assignment through the middle stage of 

the switch between theS logical input elements and the logical output elements; 
providing time-slot interchange capabilities at the r^ 3"*, 5* stages; 

and 

performing time-slot assignment between the ingress subelements of 
25 the ingress elements and the logical output ports of the 3"* stage switches, 

and performing time-slot interchange between the logical input ports of the 5**^ 
stage switches and the egress subelements of the egress elements. 

2. A time-slot assignment process as claimed in claim 1, wherein the switch 
30 arrangement is arranged to switch cells or packets. 



3. 



A time-slot assignment process as claimed in claim 1, wherein the switch 
arrangement is a circuit switch. 
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4. A time-slot . assignment process as claimed in claim 2, wherein the switch 
arrangement is a cell switch arrangement capable of switching packets. 

5 5. A time-slot assignment process as claimed in any preceding claim, further 
comprising a 7* stage providing a time-slot interchange capability. 

6. A time-slot assignment process as claimed in any preceding claim, wherein the 
subelements comprise ports, and the elements comprise aggregations of ports. 

10 

7. A time-slot assignment process as claimed in any preceding claim, wherein 
the input-queued switch comprises VOQs which are implemented in random 
access memory RAM. 

15 8. A time-slot assignment process as claimed in any preceding claim, further 
comprising recursive decomposition of the three stages in each element into 
seven stages using the steps indicated in claim 1 . 

9. A switch arrangement in which time-slots are assigned by a time-slot 
20 assignment process as claimed in any preceding claim. 

10. A network having a switch arrangement as claimed in claim 9. 

11. A suite of at least one computer program arranged to implement the time- 
25 slot assignment process according to any one of claims 1 to 8. 

12. A suite of at least one computer program as claimed in claim 11, at least 
partly arranged to be implemented in hardware. 

30 13. A suite of at least one computer program as claimed in claim 11 or 12, at 
least partly arranged to be implemented In software. 
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14. A time-slot assignment process substantially as described herein, with 
reference to the accompanying drawings. 
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ABSTRACT 

MULTI-STAGE TIME-SLOT ASSIGNEMENT PROCESS 

A multi-stage time-slot assignment process for an Input-queued switch arrangement 
5 in a communications network, the switch comprising a plurality of N ingress elements 
and N egress elements, each of the ingress elements having a number L of ingress 
subelements and each of the egress elements having a plurality L of egress 
subelements, the switch arrangement being arranged to have L or more real middle 
stage space switches of size N x N, and having F or more time-slots, the time-slot 

10 assignment process between the said ingress subelements and egress subelements 
comprising the steps of: aggregating F time slots from each of a plurality L in number 
of said ingress subelements to form an ingress element having a plurality LF or more 
in number of time-space channels which are pooled between the L subelements of 
each ingress element and the L subelements of each egress element, wherein each 

1 5 time-space channel corresponds to a different logical middle-stage switch of the 
packet switch arrangement so that the number of logical input elements and logical 
output elements for which channel assignment is performed through the middle stage 
of the switch is N; performing time-space channel assignment through the middle 
stage of the switch between the logical input elements and the logical output 

20 elements; providing time-slot interchange capabilities at the 1**, 3'", 5* stages; and 
perfomiing time-slot assignment between the ingress subelements of the ingress 
elements and the logical output ports of the 3"" stage switches, and performing time- 
slot interchange between the logical input ports of the 5* stage switches and the 
egress subelements of the egress elements. 

25 



Figure (3) 
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